Maithili Text to Speech Corpus
OverView
30:59:20 hours | 19.56 GB | 32260 Audio Segments | 2 SpeakersThe LDC-IL Maithili Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Devanagari script. This dataset spans a ...
Categories
Cart
Account
Search
Recent View
Go to Top
All Categories
×
Request Cart
×
Your request cart is empty!
Search
×
Recent View Datasets
×
Dataset Description
30:59:20 hours | 19.56 GB | 32260 Audio Segments | 2 Speakers
The LDC-IL Maithili Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Devanagari script. This dataset spans a duration of 32:42:20 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Maithili speakers. A comprehensive explanation of dataset can be found in the Maithili Text to Speech Documentation.
For any research-based citations, please use the following citations:
- Shantanu Kumar, Dinesh Mishra, Saurabh Varik, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Maithili Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-36-1.
- Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.
Item specifics
- Authors Shantanu Kumar, Dinesh Mishra, Saurabh Varik, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan
- Corpus Type TTS Corpus
- Catalogue Number 1515
- ISBN 978-93-48633-36-1
- Data Source Studio
- Duration 30:59:20 hours
- # of Audio Segments 32260
- Release Date 20/03/2025
- Terms and Conditions General instructions for use of the resources provided by LDC-IL.
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview